Physica A: Statistical Mechanics and its Applications
○ Elsevier BV
Preprints posted in the last 90 days, ranked by how well they match Physica A: Statistical Mechanics and its Applications's content profile, based on 10 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Pachter, L.
Show abstract
We introduce a spectral existence criterion for the evolution of cooperation in the form of the inequality{lambda} maxb > c, where{lambda} max is the leading eigenvalue of an interaction operator encoding population structure, and b and c represent benefit and cost tradeoffs, respectively. Nowaks five rules for the evolution of cooperation correspond to cases in which the cooperation condition reduces to a scalar assortment coefficient. These results follow from the Price equation, which sheds light on a long-standing debate on the role of inclusive fitness and evolutionary dynamics in explaining the evolution of cooperation.
Khilar, S.; Natarajan, E.
Show abstract
Predicting protein-ligand interactions in the modern drug discovery has revolved from the involvement of artificial intelligence and structural bioinformatics using Graph Neural Networks (GNNs). The limited explainability of GNN models presents an important encumbrance in biomedical research, but it has achieved a high degree of accuracy in determining and identifying binding affinity and active compounds, as evidenced by [1] [2] [3] [4]. Here this research focuses on the interpretation of protein-ligand interactions at a molecular level, a rapidly developing area within Graph Neural Networks (GNNs). Now days modern study handling techniques such as visualization techniques, attention mechanism and model-based feature ascription by model to boost, and make robust and decrease false predictions on binding. Along with some approaches include like graph pooling strategies, message-passing optimization, self-supervised learning, transfer learning and contrastive learning are rapidly utilized to enhance the representative learnings. Furthermore, integration of molecular docking simulations, hybrid deep learning architectures and protein language model gives more reliable & biological predictions of protein-ligand interactions. That focuses on given process that identifies key ligand atoms and binding residues, as well as physicochemical factors influencing affinity, through chemical thought processes. Here this research work identified the challenges of developing biologically significant explanations, transparency, and the corollary dataset biases on interpretability. The research work conducted an in-depth investigation into the consolidation of protein language models to establish more reliable pathways for future research, examining hybrid architectures, transparent and energy-efficient GNNs, and scientifically grounded AI models for drug discovery. My research work highlights that XGNNs establishes a connection between Deep Learning and Biochemical expertise with increased confidence, which will enhance the accuracy of predictive models and computational models.
Sukekawa, T.; Ei, S.-I.
Show abstract
Mass-conserved reaction-diffusion systems are used as mathematical models for various phenomena such as cell polarity. Numerical simulations of this system present transient dynamics in which multiple stripe patterns converge to spatially monotonic patterns. Previous studies indicated that the transient dynamics are driven by a mass conservation law and by variations in the amount of substance contained in each pattern, which we refer to as "pattern flux". However, it is challenging to mathematically investigate these pattern dynamics. In this study, we introduce a reaction-diffusion compartment model to investigate the pattern dynamics in view of the conservation law and the pattern flux. This model is defined on multiple intervals (compartments), and diffusive couplings are imposed on each boundary of the compartments. Corresponding to the transient dynamics in the original system, we consider the dynamics around stripe patterns in the compartment model. We derive ordinary differential equations describing the pattern dynamics of the compartment model and analyze the existence and stability of equilibria for the reduced ODE with respect to the boundary parameters. For a specific parameter setting, we obtained results consistent with previous studies. Moreover, we present that the stripe patterns in the compartment model are potentially stabilized by changing the parameter, which is not observed in the original system. We expect that the methodology developed in this paper is extendable to various directions, such as membrane-induced pattern control.
Kuznetsov, A. V.
Show abstract
Alzheimers disease (AD) is characterized by the accumulation of amyloid-{beta} (A{beta}), yet the specific link between plaque burden and cognitive decline remains a subject of intense investigation. This paper presents a mathematical model that simulates the coupled dynamics of A{beta} monomers, soluble oligomers, and fibrillar species in the brain tissue. By modifying existing moment equations to include a dedicated conservation equation for A{beta} monomers, the model explores how various microscopic processes, such as primary nucleation, surface-catalyzed secondary nucleation, fibril elongation, and fragmentation, contribute to macroscopic disease progression. Central to this study is the concept of "accumulated neurotoxicity" as a surrogate marker of biological age, defined as the time-integrated concentration of soluble A{beta} oligomers. Unlike plaque burden, accumulated neurotoxicity cannot be reversed, and the harm it causes depends critically on the sequence of events that produced it. Numerical results demonstrate that while plaque burden and neurotoxicity both increase over time, their relationship is non-linear and highly sensitive to the efficiency of protein degradation machinery. Specifically, impaired degradation leads to a rapid advancement of biological age relative to calendar age. The model further identifies oligomer dissociation and fibril fragmentation as potential protective mechanisms that can counterintuitively reduce neurotoxic burden by diverting monomers away from the soluble oligomer pool. These findings provide a quantitative framework for understanding why individuals with similar plaque burdens may experience vastly different cognitive outcomes, underscoring the importance of targeting soluble oligomers early in therapeutic interventions.
Saez, M.; Minas, G.; Camacho-Aguilar, E.; Rand, D. A.
Show abstract
During embryogenesis, as cells proliferate and assemble into tissues, they undergo a sequence of transitions between distinct molecular states eventually giving rise to a cellular population consisting of an appropriate distribution of specific functional cell types. Recent progress on the dynamics underlying decision-making in developmental landscape makes it feasible to start analysing the amount of information involved in constructing such systems. To explore this we introduce the notion of potency of a developmental landscape and attempt to calculate it for two development systems of current interest, in-vitro differentiation of epiblast-like cells into neural and mesodermal progenitors and the worm vulva patterning system. Our approach integrates concepts from developmental biology, information theory and dynamical systems to estimate both the number and identity of signalling regimes that give rise to distinguishable temporal response patterns.
Garay, J.; Mori, T. F.
Show abstract
Price equation and genotype dynamics are two methods for studying the fixation of one allele by natural selection in a diploid population. There are two strict monotonicity conditions that imply the fixation of one allele. The genotype dynamics is called Haldane monotone if the relative frequency of one allele strictly increases along all solutions of the genotype dynamics, so this allele is fixed. In this paper, we show that the genotype dynamics is Haldane monotone if and only if the right-hand side of the Price equation is always strictly positive. The other strict monotonicity condition requires that the relative frequency of a homozygote strictly increase according to the genotype dynamics. For example, in a model where the genotype dynamics is governed by interactions between individuals, the cost-accepting homozygote is fixed by natural selection if the other genotypes always receive a smaller average gain from all interactions than the cost-accepting homozygote. Both monotonicity conditions require that the interaction is not well-mixed in the population. These two conditions are not equivalent. In addition, we give a non-monotonicity condition, which also implies the fixation of a homozygote. The fixation of a homozygote depends on the phenotypic payoff of the interaction, the genotype-phenotype mapping, and the interaction scheme. In a sexual population, the interaction scheme of siblings depends on the mating system, and so do the conditions of fixation of the cost-accepting homozygote. We present examples showing that if we only change the monogamous mating system, assuming panmixing or mating assortativity, then the condition for the fixation of the cooperator homozygote is b > 2c and b > c, respectively.
Ahammed, F.
Show abstract
Fraud in the health landscape is an aggravating issue, with far-reaching consequences burdening the financial stability of the health industry and threatening the quality of medical care. It results from vulnerabilities within the current healthcare framework that are exploited by the fraudsters in their favor. In spite of many developed models that aim to detect fraudulent patterns in insurance claims, the accuracy of such models frequently suffers as a result of the imbalance issue of the Medicare dataset and irrelevant features. This study ventures to improve detection performance and accuracy by employing a deep learning model along with data sampling and feature selection techniques. Comparative analysis among different combinations is conducted to determine their efficacy to enhance the accuracy of the fraud detection model. Hence, the suggested model clearly demonstrates that a combination of myriad data sampling and feature selection techniques is helping to improve accuracy and performance. The accuracy was thus 95.4%, with negligible evidence of overfitting detected using both Chi-square and Synthetic Minority Over-sampling (SMOTE) techniques. Ultimately, the study findings underscore the significance of employing combined techniques instead of using only the baseline deep learning model for better performance in detecting Medicare insurance fraud.
Hwang, C.-K.; Chen, Y.-W.; WANG, Y.-T.; Ho, T.-S.; Oyang, Y.-J.
Show abstract
BackgroundDengue has been a major health threat globally in recent years. In particular, dengue incidences continue to increase annually and the epidemic area has expanded primarily due to global warming. Therefore, effective case detection and surveillance strategies are crucial to tackle this global health challenge. In clinical practice, the rapid test kit detecting dengue non-structural protein 1 antigen and commonly referred as NS1, is widely employed for early diagnosis. However, real-world studies revealed that the sensitivity of the NS1 test kit ranged from approximately 61% to 95%. Since early diagnosis is really critical for disease surveillance in the early stage of a dengue epidemic, scientists have been working hard to develop novel diagnosis methods that can provide higher sensitivity levels. Methodology/Principal FindingsIn response to this challenge, in this study, we have developed a novel diagnosis procedure that integrates machine learning technologies with the NS1 test kit. Our experimental results revealed that we would be able to raise the sensitivity of the dengue diagnosis procedure to higher than 99% by incorporating machine learning based prediction models to screen the suspected patients with a negative NS1 result. Furthermore, the relative risks between the suspected patients who were predicted to be positive and those who were predicted to be negative exceeded 4.8. Conclusions/SignificanceThese results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to address the global health challenge caused by spread of dengue. Author SummaryThis study has aimed to enhance surveillance of the dengue disease by integrating machine learning technologies with the rapid test kit commonly employed in early diagnosis. In clinical practice, the NS1 rapid test kit is widely employed for early diagnosis. However, real-world studies revealed that a certain percentage of the patients with a negative NS1 test result, ranging from 5% to 39%, were actually infected by dengue. Since early diagnosis is critical for disease control in the early stage of a dengue epidemic, scientists have been working hard to tackle this challenge. Based on this observation, this study was launched to investigate the effects of incorporating machine learning based prediction models to further screen those patients with a negative NS1 test result. The experimental results revealed that the proposed approach was able to identify over 99% of the patients who were infected by the dengue disease. Furthermore, the risk of the suspected patients who were predicted to be positive was 4.8 times higher than the risk of those who were predicted to be negative. The experimental results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to enhance surveillance of the dengue disease.
Jaeger, K. H.; Tveito, A.
Show abstract
The Poisson-Nernst-Planck (PNP) system is an accurate model of electrodiffusion of ionic species. It is commonly used in situations where nanoscale resolution is required, for instance close to ion channels in the membranes of biological cells. The inherent stiffness of the equations has made them challenging to solve and has limited the applicability of the system. In particular, the time step required for stable solutions has typically needed to be very short (nanoseconds), which makes simulations on the time scale of an action potential (milliseconds) difficult. Recently, it has been observed that avoiding operator splitting and instead solving the concentration equations and the electrostatic equation in a coupled manner relaxes the time-step limitation considerably. However, no theoretical explanation of this observation has been provided. Here, we aim to explain why the coupled scheme allows much larger time steps. We illustrate the mechanism by considering special cases that define necessary, but not sufficient, conditions for stability. We also show that these conditions remain relevant for the fully coupled PNP model in 3D.
Khan, H.; Garcia-Galindo, P.; Ahnert, S. E.; Dingle, K.
Show abstract
A morphospace is an abstract space of theoretically possible biological traits, shapes, or property values. It is interesting to explore which parts of a morphospace life occupies, as compared to those parts which could be occupied, but are not. Comparing random and natural non-coding (nc) RNA secondary structures is an established approach to studying morphospace occupation for RNA structures. Most earlier studies have focused on the minimum free energy (MFE) structure, while relatively few have looked at the Boltzmann distribution, describing the ensemble of energetically suboptimal RNA folds. These suboptimal structures may have important roles and functions, and hence should be examined carefully. Here we compare random and natural ncRNA in terms of their Boltzmann distributions, finding that natural RNA tend to have very similar profiles to random RNA, with the main difference being that natural RNA are slightly more energetically stable, except for very short sequences (20 to 30 nucleotides) which tend to be slightly less stable. We infer that natural ncRNA occupy similar parts of the morphospace that random RNA do, indicating that the biophysics of the genotype-phenotype map largely determines the ensemble properties of ncRNA.
Chiu, C.; Jawaid, M. Z.; Cox, D. L.
Show abstract
Background/ObjectivesThe unprecedented structural and binding data for antibodies to the SARS-COV2 virus taken together with the mutations for the spike protein allows for a broad simulation study of antibody-spike protein binding. This provides an understanding of the co-evolution of human immunity and viral immunity escape. MethodsWe utilized the YASARA molecular dynamics program to generate initial antibody-spike structures and simulate to equilibration for six SARS-COV2 variants and 10 different antibodies sampling two different binding regions to the receptor binding domain of the spike (especially for the Class I antibodies in the same part of the spike which attaches to the ACE2 receptor protein) and one to the N-terminal of the spike. Starting structures for antibody binding to variant spike proteins are perturbatively achieved through point mutations and insertions/deletions in the YASARA program. We employed YASARA to measure interfacial hydrogen bound counts between antibodies and variant spike proteins, and the HawkDock MMGBSA program to characterize trends in binding energies with mutation for four of the antibodies. We utilized the VMD program to analyze the time course of hydrogen bond populations. ResultsAs seen in previous studies, interfacial hydrogen bond counts serve as an excellent proxy for binding energies without the large systematic error inherent in the latter. We find that there is generally a decline in antibody binding strength, as measured by interfacial hydrogen bond counts, with viral evolution, but that a modest re-entrance of binding strength is present for most antibodies studied. Generically, the antibody heavy chain binds more strongly to the spike protein, through for approximately half the antibodies the light chain binding strength converges to the heavy chain strength with viral evolution. ConclusionsThe key conclusion is that the identified re-entrant immunity, speculatively arising from a balancing of maintenance of ACE2-spike binding while escaping antibodies through mutation, allows for some maintenance and even strengthening of immunity for later viral strains from early infection or vaccination.
Wieners, L.; Garcia, M. E.
Show abstract
The light absorption of the protein cryptochrome and its chromophore FAD is important for the regulation of circadian rhythms and in some species for sensing magnetic fields. To compute the absorption spectrum of chromophore, typically only a small region is treated quantum-mechanically due the high computational cost of spectroscopic calculations. We present a formalism that allows a quantum-mechanical treatment of not only the chromophore but also the neighbouring amino acids which differ from species to species. This is achieved by using the real-time time-dependent Hartree-Fock method. This method allows extending the quantum domain from typically only a few dozen atoms up to around 1,200 atoms for the largest calculations. The presented framework allows the treatment of neighbouring tryptophan residues or the cofactor molecule MTHF in the same calculation and allows to extract information of which regions absorb light depending on wavelength. The presented results also show that the environment around the chromophore FAD amplifies the light absorption in cryptochrome.
Cresson, J.; Pere, M.; Szafranska, A.
Show abstract
This work focuses on the global and partial identification problem for fractional differential equations. We provide a general numerical procedure based on global and local optimization algorithms with two refinements for biological systems that ensure solution positivity and homogeneous parameter units. The method is applied to a new fractional model of Dengue outbreak called the Fractional Homogeneous Nishiura (FHN) model, calibrated using data of newly infected people in Cape Verde. We show that our identification method yields a better fit between data and model solutions than previous approaches and that our FHN model captures the dynamics of Dengue more closely than existing systems.
Deng, F.; Li, H.; Sun, D.; Duan, G.; Sun, Z.; Xue, G.
Show abstract
High level of protein expression is usually welcomed in industry and research, and codon optimization is widely used to achieve high expression. Methods of implementing codon optimization can be divided into two branches, one is classical methods which develop cost functions based on empirical law, another is AI methods which learn the codon choice principles from endogenous genes with neural networks. Here we develop two codon optimization tools based on two branches respectively, namely OptimWiz 2.1 and OptimWiz 3.0. Results of fusion protein fluorescence detection indicate that both OptimWiz 2.1 and OptimWiz 3.0 are superior to all the other commercially available codon optimization tools. Principles of codon optimization are revealed in the process of machine learning on both tools.
Diaz Ochoa, J. G.; Puskaric, M.; Layer, N.; Jensch, A.; Knott, M.; Krohn, A.
Show abstract
Graph-based methods for data representation and analysis are well suited for encoding both data points and their interrelationships. This approach integrates data and topology, enabling the representation of interrelated information. In this study, we represent patient cohorts as cohort graphs and discuss their application for real-world patient data. We particularly focus on developing methods to cluster patients with similar symptoms and examine how bias parameters (such as sex and age group) influence interlinking within CGs, thereby improving results for accurate patient stratification and personalized decision-making in a clinical context. In particular we illustrate how by considering sex and age groups we can improve the symptom-clustering of a patient population with lung and gastro-intestinal cancer. Finally, we discuss the essential role of high-performance computing (HPC) in upscaling analytical methods for CGs.
Sajjad, M.
Show abstract
Artificial intelligence (AI) tools have been rapidly adopted by medical researchers, yet whether early career researchers in low and middle income countries possess the awareness and habits needed to use these tools safely remains poorly documented. This study characterized AI adoption patterns, hallucination awareness, and verification and disclosure practices among early career medical researchers in Pakistan. A cross sectional anonymous online survey was conducted among medical students, house officers, residents, physicians, and faculty involved in research or academic work across Pakistan (May 2026). Descriptive statistics and chi square tests were applied to 373 eligible responses. AI use was near universal (99.7%), with 60.3% using AI tools daily. The most commonly reported tool in this sample was Claude (40.5%), followed by ChatGPT (29.2%) and Perplexity (26.0%), though this ranking likely reflects sampling characteristics. Despite high adoption, 59.2% typically did not verify AI outputs before use, and 40.2% had never heard that AI can generate fabricated scientific references. In behavioral vignettes, 36.5% assumed convincing AI generated references were authentic, and 54.2% would continue using remaining AI content after discovering one fabricated reference. Formal research training was strongly associated with consistent disclosure (51.7% vs. 17.1%; chi square=48.43, p less than 0.001). Role, daily use frequency, and research training were not significantly associated with verification behavior. Early career medical researchers in Pakistan demonstrate high AI adoption alongside incomplete hallucination awareness and infrequent verification, a pattern that may carry implications for research integrity. Formal training was the only factor significantly associated with consistent disclosure. Integration of AI literacy into medical curricula and institutional governance frameworks merits consideration.
Hernandez Vargas, E. A.
Show abstract
Evolutionary therapies regulate heterogeneous populations by altering selective pressures through treatment sequences in cancer and infections. This letter develops an invariant-set framework for treatment-induced containment based on positive triangular invariant sets. For periodically switched systems, sufficient conditions are derived for the existence of such invariant regions. Robustness with respect to mutation is established by showing that the invariant simplex persists under small perturbations of the subsystem matrices. In the two-phenotype case, the analysis yields an explicit mutation threshold that separates regimes in which therapy cycling maintains containment from regimes in which mutation can enable evolutionary escape. Simulations illustrate the geometry of the invariant sets and the role of mutation and dwell time in containment robustness.
Pubal, K.; Kushnir, K.; Spiwok, V.; Louzecka, K.; Setnicka, V.; Lipovova, P.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWProteins are built from 20 canonical amino acids. It is interesting to explore whether proteins can be formed from significantly reduced amino acid alphabets. Our bioinformatics survey of UniProt (more than 250 M sequences) revealed that proteins composed of reduced amino acid alphabets (< 10) are extremely rare among existing proteins. Next, we used computational protein design to design proteins composed of all 1,013 possible alphabets of 2-10 early amino acids (Ala, Asp, Glu, Gly, Ile, Leu, Pro, Ser, Thr, and Val). The length of all proteins was 100 amino acid residues. Small amino acid alphabets preferred simple helices or helix bundles. Larger amino acid alphabets allowed for the design of more complex structures. A protein composed of 8 amino acids (Ala, Asp, Gly, Leu, Val, Ser, Thr, and Pro) was successfully experimentally verified. It belongs to fibronectin type III domain {beta}-sheet-rich architecture. Attempts to experimentally verify designs composed of 6 and 4 amino acids were unsuccessful. We show by a computational experiment without an experimental validation that inverse folding programs, namely ProteinMPNN, can stabilize designed proteins within the same amino acid alphabet. Our results show that globular proteins may have formed early in evolution. Furthermore, we show that it is possible to design proteins with interesting properties for biotechnology and synthetic biology.
Kuznetsov, A. V.
Show abstract
AA amyloidosis is a severe complication of chronic inflammatory diseases characterized by fibrillar protein deposition in the kidneys, leading to progressive organ failure. This study presents a mathematical model coupling SAA-HDL binding dynamics with renal amyloid aggregation kinetics to elucidate disease pathogenesis. Under normal conditions, Serum Amyloid A (SAA) circulates bound to high-density lipoprotein (HDL), which acts as a molecular chaperone preventing misfolding. However, during chronic inflammation, SAA production exceeds HDL binding capacity, resulting in free SAA that undergoes renal filtration. The model calculates free SAA concentration from reversible binding equilibrium and incorporates renal filtration, mesangial accumulation, and conversion to amyloid fibrils through primary nucleation and autocatalytic growth mechanisms. A central contribution of this work is quantifying accumulated nephrotoxicity arising from AA oligomers, which inflict cumulative cytotoxic damage to mesangial and tubular cells over time. Because oligomers are continuously generated during ongoing aggregation, their toxic burden integrates across the entire duration of the disease. Combined nephrotoxicity, encompassing both oligomer-mediated cellular injury and fibril-driven mechanical disruption of renal architecture, therefore reflects not merely the current disease state but the full inflammatory trajectory of the patient. This cumulative damage defines renal biological age, a measure of functional deterioration whose portion attributable to accumulated nephrotoxicity is irreversible. Renal biological age is also path-dependent: two patients with identical present-day SAA levels may carry different renal damage burdens depending on the duration, timing, and severity of their prior inflammatory episodes. Sensitivity analysis reveals that HDL concentration and SAA cleavage rate are critical determinants of amyloid burden.
Chu, A. L.; Chu, B. S. L.; Qiang, W.
Show abstract
Formation of the {beta}-amyloid (A{beta}) plaques is a pathological hallmark of Alzheimers disease (AD), and is believed to be a primary cause of dementia in elderly individuals. In the present work, we simulated the conformational evolution of A{beta}42 dimers in solution and in membrane-like environment to explore the folding of A{beta}42 along fibrillation. The molecular dynamics (MD) simulation was steered by experimental internuclear distance restraints obtained using solid-state nuclear magnetic resonance (ssNMR) spectroscopy. Our results revealed that several hydrophobic and polar motifs within the A{beta}42 sequence played key roles in the early-stage nucleation process of fibrillation and those motifs are also the stabilizing agents in the mature fibrils judged by the energy contribution. Our results also indicated that the peptide association with membrane bilayers could modulate the structural evolution pathways towards fibrillation. These findings contributed to a better understanding of the molecular level structural polymorphisms inherent to A{beta}42 fibrils. Further, the current work demonstrated that the combination of MD simulations with ssNMR-based experimental restraints provided a reliable method for studying structural changes of A{beta}. HighlightO_LIUsing solid-state NMR restraints guided molecular dynamic simulation, {beta}-amyloid dimers displayed consistent {beta}-strand-prone regions, which are major stabilizing segments for mature fibrils. C_LIO_LI{beta}-amyloid dimers evolved differently with or without interacting with the lipid bilayers. C_LIO_LIExperimental restraints guided simulation provided molecular level insights about early-stage interactions along the progress of {beta}-amyloid fibrillation C_LI